Preference-aware Integration of Temporal Data
نویسندگان
چکیده
A complete description of an entity is rarely contained in a single data source, but rather, it is often distributed across different data sources. Applications based on personal electronic health records, sentiment analysis, and financial records all illustrate that significant value can be derived from integrated, consistent, and queryable profiles of entities from different sources. Even more so, such integrated profiles are considerably enhanced if temporal information from different sources is carefully accounted for. We develop a simple and yet versatile operator, called PRAWN, that is typically called as a final step of an entity integration workflow. PRAWN is capable of consistently integrating and resolving temporal conflicts in data that may contain multiple dimensions of time based on a set of preference rules specified by a user (hence the name PRAWN for preference-aware union). In the event that not all conflicts can be resolved through preferences, one can enumerate each possible consistent interpretation of the result returned by PRAWN at a given time point through a polynomialdelay algorithm. In addition to providing algorithms for implementing PRAWN, we study and establish several desirable properties of PRAWN. First, PRAWN produces the same temporally integrated outcome, modulo representation of time, regardless of the order in which data sources are integrated. Second, PRAWN can be customized to integrate temporal data for different applications by specifying application-specific preference rules. Third, we show experimentally that our implementation of PRAWN is feasible on both “small” and “big” data platforms in that it is efficient in both storage and execution time. Finally, we demonstrate a fundamental advantage of PRAWN: we illustrate that standard query languages can be immediately used to pose useful temporal queries over the integrated and resolved entity repository.
منابع مشابه
Context-aware Modeling for Spatio-temporal Data Transmitted from a Wireless Body Sensor Network
Context-aware systems must be interoperable and work across different platforms at any time and in any place. Context data collected from wireless body area networks (WBAN) may be heterogeneous and imperfect, which makes their design and implementation difficult. In this research, we introduce a model which takes the dynamic nature of a context-aware system into consideration. This model is con...
متن کاملProper integration time of polarization signals of internetwork regions using Sunrise/IMaX data
Distribution of magnetic fields in the quiet-Sun internetwork areas has been affected by weak polarization (in particular Stokes Q and U) signals. To improve the signal-to-noise ratio (SNR) of the weak polarization signals, several approaches, including temporal integrations, have been proposed in the literature. In this study, we aim to investigate a proper temporal-integration time with which...
متن کاملA Novel Method for Measuring the Quality of Temporal Integration in Public Transport Systems
Temporal coordination of services, as a crucial aspect of integration in public transport systems, has always been a big concern for transit planners and schedulers. One of the major issues in the way of coordinating transit services is the lack of a robust measure of effectiveness for assessing the quality of temporal coordination in public transport systems. Even though the network-wide summa...
متن کاملCurriculum integration in Medical Sciences: Perspective of Faculty Members in Tabriz University of Medical sciences
Introduction: Curriculum integration is a newly emerged topic in medical education. Given the importance of integration in improving the quality of education, the aim of this study was to examine the extent and order of attention to steps of curriculum integration in Tabriz University of Medical Sciences. Methods: This descriptive-survey research was carried out in 2015 on a sample of 177 facu...
متن کاملEffects of range condition on the temporal diet selection by goats in steppe rangelands of Iran
One of the key factors in managing a rangeland is to determine the relative preference of its major plant species by thegrazing livestock. Preference value of each plant species is affected by plant type, companion plants, availability byanimals, phenological stage, climate condition, and the livestock species. We investigated the grazing behaviour of anative goat (Garizi) in the steppe rangela...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PVLDB
دوره 8 شماره
صفحات -
تاریخ انتشار 2014